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Abstract. A data structure is presented for point location in connected planar subdivisions when the 
distribution of queries is known in advance. The data structure has an expected query time that is within 
a constant factor of optimal. More specifically, an algorithm is presented that preprocesses a connected 
planar subdivision G of size n and a query distribution D to produce a point location data structure for 
G. The expected number of point-line comparisons performed by this data structure, when the queries are 
distributed according to D, is H + 0(H 2 ^ 3 + 1) where H = H(G, D) is a lower bound on the expected 
number of point-line comparisons performed by any linear decision tree for point location in G under the 
query distribution D. The preprocessing algorithm runs in 0(n log n) time and produces a data structure 
of size 0(n). These results are obtained by creating a Steiner triangulation of G that has near- minimum 
entropy. 



1 Introduction 



The planar point location problem is the classic search problem in computational geometry. Given a planar 
subdivision G0 the planar point location problem asks us to construct a data structure so that, for any 
query point q, we can quickly determine which face of G contains 

The history of the planar point location problem parallels, in many ways, the history of binary 
search trees. After a few initial attempts [TTJ [T51 [^T] , asymptotically optimal (and quite different) linear- 
space O(logn) query time solutions to the planar point location problem were obtained by Kirkpatrick [17j . 
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1 A planar subdivision is a partitioning of the plane into points (called vertices), open line segments (call edges), and maximal 
connected 2-dimensional regions (called faces). 

2 In the degenerate case where q is a vertex or contained in an edge of G any face incident on that vertex/edge may be 
returned as an answer. 
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Sarnak and Tarjan [23], and Edelsbrunner et al. [12] in the mid 1980s. These results were based on hierarchical 
simplification, data structural persistence, and fractional cascading, respectively. All three of these techniques 
have subsequently found many other applications. An elegant randomized solution, combining aspects of all 
three previous solutions, was later given by Mulmuley [20], and uses randomized incremental construction, a 
technique that has since become pervasive in computational geometry [TUJ Section 9.5]. Preparata [22J gives 
a comprehensive survey of the results of this era. 

In the 1990s, several authors became interested in determining the exact constants achievable in 
the query time. Goodrich et al. [13j gave a linear-size data structure that, for any query, requires at most 
2 log n + o(logn) point-line comparisons and conjectured that this query time was optimal for linear-space 
data structures^ The following year, Adamy and Seidel [1] gave a linear-space data structure that answers 
queries using logn + 2^/log n + O(loglogn) point-line comparisons and showed that this result is optimal up 
to the third term. 

Still not done with the problem, several authors considered the point location problem under various 
assumptions about the query distribution. All these solutions compare the expected query time to the entropy 
bound; in a planar subdivision G with m faces F%, . . . , F m , if Pr(i r j) is the probability that q is contained 
in Fi, then no algorithm that makes only binary decisions can answer queries using an expected number of 
decisions that is fewer than 



In the previous results on planar point location, none of the query times are affected significantly by 
the structure of G; they hold for arbitrary planar subdivisions. However, when studying point location under 
a distribution assumption the problem becomes more complicated and the results become more specific. A 
connected subdivision is a planar subdivision whose underlying (vertex and edge) graph is connected. A 
convex subdivision is a planar subdivision whose faces are all convex polygons, except the outer face, which 
is the complement of a convex polygon. A triangulation is a convex subdivision in which each face has at 
most 3 edges on its boundary. 

Note that, if every face of G has a constant number of sides, then G can be augmented, by the addition 
of extra edges, so that it is a triangulation without increasing (JlJ by more than a constant. Similarly, in 
a convex subdivision G where the query distribution D is uniform within each face of G, the faces of the 
subdivision can be triangulated without increasing the entropy by more than a constant [5]. Thus, in the 
following we will simply refer to results about triangulations where it is understood that these also imply 
the same result for planar subdivisions with faces of constant size or convex subdivisions when the query 
distribution is uniform within each face. 

Arya et al. [2] gave two results for the case where the query point p is chosen from a known distribu- 
tion where the x and y coordinates of p are chosen independently and G is a convex subdivision. They gave 
an 0(n) space data structure for which the expected number of point-line comparisons is at most AH + 0(1) 
and an 0(n 2 ) space data structure for which the expected number of point- line comparisons is at most 
2H + 0(1). The assumption about the independence of the x and y coordinates of p is crucial to the these 
results. 

For arbitrary distributions that are known in advance, several results exist. Iacono [15] 116] showed 
that, for triangulations, a simple variant of Kirkpatrick's original point location structure gives a linear space, 
0(H + 1) expected query time data structure. Simultaneously, and independently, Arya et al. [5] showed 

3 Here and throughout, logarithms are implicitly base 2 unless otherwise specified. 
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that a variant of Mulmuley's randomized data structure also achieves 0(H + 1) expected query time. A 
sequence of papers by Arya et al. [H|4l[6] has recently culminated in an 0(n) space data structure for point 
location in triangulations with query time H + 0{H 1 / 2 + 1) [5J. 

In the current paper, we show that, for any connected planar subdivision, there exists a data structure 
of size 0(n) that can answer point location queries using H + 0(H 2 ^ 3 + 1) point/line comparisons. Here, 
H = H{G, D) is a lower bound on the expected cost of any linear decision tree that solves this problem. Note 
that H is often greater than the quantity H defined above and this is necessarily so. To see this, consider 
that the problem of testing whether a query point is contained in a simple polygon P with n vertices is a 
special case of planar point location in a connected planar subdivision. However, in this special case the 
subdivision only has 2 faces, so H < 1. It seems unlikely that, for any simple polygon P and any probability 
measure D over R 2 , it is always possible to test in O(l) expected time if a point p drawn from D is contained 
in P. Indeed, it is not hard to design a convex polygon P and distribution D so that the expected cost of 
any algebraic decision tree for point location in P, under query distribution D, is fi(logn). 

Note that all known algorithms for planar point location that do not place special restrictions on the 
input subdivision can be described in the linear decision tree model of computation^ The data structures 
presented in the current paper are the most general results known about planar point location and imply, to 
within a lower order term, all of the results discussed in the introduction. 

We achieve our results by showing how to compute a Steiner triangulation A = A(G, D) of G that 
has nearly minimum entropy over all possible triangulations of G and then proving that the entropy of a 
minimum-entropy Steiner triangulation of G is a lower bound on the cost of any linear decision tree for point 
location in G. By then applying the recent result of Arya et al. to the Steiner triangulation A we obtain 
upper and lower bounds that match to within a lower-order term. 

A preliminary version of this paper, which dealt only with convex subdivisions, has appeared in the 
Proceedings of the 19th ACM-SIAM Symposium on Discrete Algorithms (SODA 2008) [9]. 

The remainder of this paper is organized as follows: Section [2] presents definitions and notations 
used throughout the paper. Section [3] shows how to compute a near-minimum-entropy triangulation of a 
simple polygon. Finally, Section H applies these tools to obtain our point location structure for connected 
planar subdivisions. 



2 Preliminaries 



In this section we give definitions, notation, and background required in subsequent sections. 



Interiors and Boundaries. For a set P C R 2 , we denote the boundary of P by dP and the interior of P 
by int(P). The closure of P is denoted by clo(P) = P U dP. 



Triangles and Convex Polygons. For the purposes of this paper, a triangle is the common intersection 
of at most 3 closed halfplanes. This includes triangles with infinite area and triangles having 0, 1, 2, or 3, 

4 Although significant breakthroughs have recently been made in this area [8l 123] . we deliberately do not survey algorithms 
that require the vertices of the subdivision to be on integer coordinates. 
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vertices. Similarly, a convex k-gon is the common intersection of at most k closed halfplanes. 

For a closed region X C R 2 , a triangulation of X is a set of triangles whose interiors are pairwise 
disjoint and whose union is X. We use the convention that, unless X is explicitly mentioned, the triangulation 
in question is a triangulation of R 2 . This definition of a triangulation is often referred to as a Steiner 
triangulation since it allows vertices of the triangles to be anywhere in X, and not at some finite predefined 
set of locations. 



Simple Polygons, Pseudotriangles, and Geodesic Triangles. A (near-simple) polygon P is a closed 
subset of R 2 whose boundary is piecewise linear and such that int(P) is homeomorphic to an open disk. 
Note that this definition of a polygon implies that every bounded face of a connected planar subdivision is 
a polygon. Also, triangles, as defined above, are polygons. Note that near-simple polygons are slightly more 
general than simple polygons, for which dP is a simple closed curve. However, our definition is sufficiently 
close that algorithms designed for simple polygons continue to work with near-simple polygons. 

A reflex chain in a polygon P is a consecutive sequence of vertices Pi, . . . ,Pj of P, where the internal 
angle at pk is at least tt, for all k £ {i + 1, . . . , j — 1}. A pseudotriangle is a polygon whose boundary consists 
of 3 reflex chains. An i-convex pseudotriangle (i £ {0, 1,2,3}) is a pseudotriangle in which i of the reflex 
chains consist of single line segments. 

A shortest path between points a, b G P, denoted abp is a curve of minimum length that is contained 
in P and that has a and b as endpoints. For 3 points, a, b, c € P, a geodesic triangle in P, denoted Apabc 
is the union of all shortest paths of the form xcp, where x £ abp. Geodesic triangles are closely related to 
pseudotriangles. In particular, every geodesic triangle t consists of a pseudotriangle t and three paths joining 
the three convex vertices of t to a, 6, and c. 



Classification Problems and Classification Trees. A classification problem over a domain I? is a 
function V : D t— > {0, . . . , k — 1}. A <i-ary classification tree is a full d-aiy tree@ in which each internal node 

v is labelled with a function P v : T> >— > {0, , d — 1} and for which each leaf £ is labelled with a value 

d(£) £ {0, . . . , k — 1}. The search path of an input p in a classification tree T starts at the root of T and, at 
each internal node v, evaluates i — P v (p) and proceeds to the zth child of v. We denote by T(p) the label of 
the final (leaf) node in the search path for p. We say that the classification tree T solves the classification 
problem V over the domain T> if, for every p £ T>, V{p) — T(p). 

In this paper, we are especially concerned with linear decision trees. These are binary classification 
trees for a problem V over the domain IR 2 . Each internal node v of a linear decision tree contains a linear 
inequality P v {x, y) = ax + by > c, and the node evaluates to 1 or depending on whether the query point 
(x,y) satisfies the inequality or not, respectively. Geometrically, each internal node of T is labelled with a 
directed line and the decision to go to the left or right child depends on whether p is to the left or right (or 
on) this line. An immediate consequence of this is that, for each leaf £ of T, the closure of r(£) is a convex 
polygon. 



Probability. Throughout this paper D is a probability measure over R 2 that represents the query dis- 
tribution. The notation Pr(A) denotes the probability of event X under the probability measure D. The 

5 A full d-ary tree is a rooted ordered tree in which each non-leaf node has exactly d children. 
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notation Pr(Y\X) denotes the conditional probability of Y given X, i.e., Pr(Y\X) = Pr(Y~ n X) / Pr(X). For 
any set S 1 , we use the shorthand US' to denote U s gs s - 

For a set S of subsets of R 2 , we define the induced entropy of S, denoted by H(S) as H(S) = 
X; seS Pr(s|US')log(l/Pr(s|US')). For two sets Si,S 2 C R 2 with USi = U5 2 , the joint entropy of 5i and S 2 , is 
H(S U S 2 ) =E sieSi E S2e s 2 Pr(sins2)log(l/Pr(sins 2 )). It is well-known that if (S u S 2 ) < H(Si)+H(S 2 ) 
(see, for example, Gray [TH Lemma 2.3.2]). 

We will sometimes abuse terminology slightly by referring to a triangulation A of X as a partition 
of X into triangles, although strictly speaking this is not true since the triangles in A are closed sets that 
overlap at their boundaries. We will then continue the abuse by computing the induced entropy of A. This 
introduces a technical difficulty in that EteA P r (0 — 1 anc ^ inequality is possible if there exists sets Y C R 2 
such that the area of Y is and Pr(Y) > 0. To avoid this technical difficulty, we will assume that D is nice 
in the sense that, if the area of Y is then Pr(Y) = 0. This implies that, for every t in A Pr(t) = Pr(int(i)). 
This assumption will avoid lengthy technical but uninteresting cases in our analysis. In practice, this problem 
can be avoided by using a symbolic perturbation of the query point. 

The probability measures used in this paper are usually defined over R 2 . We make no assumptions 
about how these measures are represented, but we assume that an algorithm can, in constant time, perform 
each of the following two operations: 

1. given a triangle t, compute Pr(i), and 

2. given a triangle t and a point x at the intersection of two of t's supporting lines, compute a line i that 
contains x and that partitions t into two open triangles to and t\ such that Pr(i ) < Pr(<i) < Pr(£)/2. 

Requirement 2 is used only for convenience in describing our data structure. It is not strictly necessary, but 
its use greatly simplifies the exposition of our results. To eliminate requirement 2, one can use the same 
method described by Collette et al. Section 5] . 

For a classification tree T that solves a problem P : T> i— > {0, . . . , k — 1} and a probability measure 
D over D, the expected search time of T is the expected length of the search path for p when p is drawn at 
random from V according to D. Note that, for each leaf i of T there is a maximal subset r(£) C T> such that 
the search path for any p € r{£) ends at I. Thus, the expected search time of T (under distribution D) can 
be written as 

M r )= Pr(r^)) x depth(£) , 

e<EL{T) 

where L(T) denotes the leaves of T and depth(£) denotes the length of the path from the root of T to i. For 
any tree T we use V(T) to denote the vertices of T. 

The following theorem, which is a restatement of (half of) Shannon's Fundamental Theorem for a 
Noiseless Channel {25j Theorem 9] , is what all previous results on distribution-sensitive planar point location 
use to establish their optimality: 

Theorem 1 (Fundamental Theorem for a Noiseless Channel). Let V : T> {0, . . . , k— 1} be a classification 
problem and let p £ T> be selected from a distribution D such that Pt{V(j>) = i} = Pi, for < i < k. Then, 
any d-ary classification tree T that solves V has 

fc-i 

MT) >5>ilog d (l/ Pi ) . (2) 

i=0 
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Figure 1: The geodesic triangle t = ApiPjPk partitions P into several pieces Pi, ... , P m . 

Theorem[T]is typically applied to the point location problem by treating point location as the problem 
of classifying the query point p based on which face of G contains it. In this way, we obtain the lower bound 
in Q). 



3 Minimum Entropy Triangulations 

Let P be a simple polygon with n vertices, denoted po, . . . ,p n -i as they occur, in counterclockwise order, 
on the boundary of P. We will show how to find a triangulation of P that has near-minimum entropy. That 
is, we will find a triangulation A = A(P, D) such that H(A) is near-minimum over all triangulations of P. 
In order to shorten the formulas in this section, we will implicitly condition the distribution D on P. More 
precisely, throughout this section the notation Pt(X) should be treated as shorthand for Pr(X\P). 



3.1 The Triangulation A = A(P,D) 

Our triangulation algorithm is recursive and takes as input a polygon P and a reflex chain p^, . . . ,pj on the 
boundary of P. If P is a triangle, then there is nothing to do, so the algorithm outputs P and terminates. 
Otherwise, the algorithm first selects a point pk on the boundary of P and adds all the edges of the geodesic 
triangle t = ApiPjp^ to the triangulation A. Observe that removing t from P disconnects P into components 
Pi, ... , P m where clo(Pi) is a polygon that shares a reflex chain Cj with the pseudotriangle t (see Figurc[TJ). 
The point pk is selected in such a way that, for all i G {1, . . . ,m}, Pr(Pi) < (l/2)Pr(P)o Each of the 
sub-polygons P\ , . . . , P m can then be triangulated recursively by applying the algorithm to Pi and the reflex 
chain Q. 

To complete the triangulation A all that remains is to partition t — c\o(t \ dt) into triangles. To do 
this, we first partition t into at most one triangle t' and three 2-convex pseudotriangles to,ii,i2 as shown 
in Figured] a. Let Qt be the connected component of (int(P) \ t) U t i that contains ij. To complete the 
triangulation we will partition ti into triangles, for each i G {0,1,2}, using a recursive algorithm. This 
algorithm selects an edge of the reflex chain in ti and extends in both directions until it reaches the 
boundary of ti (see Figure Ob). The resulting line segment partitions ti into a triangle t\, and two 2-convex 

6 The existence of such a point is readily established by a standard continuity argument; see Bose et al. [7] for an example. 



G 



(a) 



(b) 



(c) 



Figure 2: Partitioning (a) a pseudotriangle i into three 2-convex pseudotriangles to, H, t% and one triangle t 
(b) a 2-convex pseudotriangle £j into one triangle ^ and two 2-convex pseudotriangles t^o and f^i, and (c) 
Qi into 4 pieces. 



Figure 3: The triangles obtained during the first level of recursive triangulation. The yellow subpolygons 
are triangulated recursively. 

pseudotriangles t^o and t^i that are triangulated recursively. At the same time, Qi is partitioned into up to 
4 pieces (see Figure (2]c): 

1. the triangle t' i: and 

2. a subpolygon Pj incident to ej, 

3. The two connected components Qi$ and Q^i of Qi \ t[ that contain tj.o and i,-^, respectively. 

The edge ej is selected so that Pr(Q a ) < (l/2)Pr(Qi) for each b e {0, 1}Q This completes the description 
of the triangulation A. A partially completed triangulation is show in Figure [3J 

7 The existence of such an edge ej is assured by yet another continuity argument. 
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3.2 The A- Tree T = T{P, D) 



In order to study the entropy of the triangulation A denned above, we will impose a tree structure on 
the pieces of P induced by the triangles in A. The A-tree T = T(P, D) for P is a tree whose nodes are 
subpolygons of P and which has the property that, for any node y that is the child of a node x, y C x. 

The tree T has three different kinds of nodes, called P-nodes, T-nodes, and Q- nodes. The root r 
of T is the polygon P and is a P-node. The root of T has the following children (defined in terms of the 
construction algorithm in the previous section; see Figure [4]): 

1. Each subpolygon P, whose boundary does not share a segment with t is a child of r and is a P-nodc. 

2. The subpolygon Q — t U Qq U Q\ U Q2 is a child of r and is called a T-node. 

The subpolygon Q has three children Qo,Qi,Q2 that are called Q-nodes. The subtree rooted at Qi 
is a ternary tree corresponding to the recursive partitioning of and Qj done by the algorithm. The leaves 
of this subtree are P-nodes and the internal nodes of this subtree are Q-nodes. Each internal node has up 
to 3 children, up to 1 of which may be a P-node corresponding to a subpolygon Pj and up to two of which 
may be Q-nodes. 

Note that the above definition yields a tree whose leaves are P-nodes that correspond to the subpoly- 
gons Pi, ... , P m obtained by removing t from P. The subtree rooted at each such leaf is obtained recursively 
from the recursive triangulation of Pi . 

Now that we have defined the tree T, we study some of its properties. Our first lemma says that T 
does a good job of splitting P based on its probabilities. 

Lemma 1. Let P be a polygon, let D be a probability measure over R 2 , and let T = T(P, D) be the A-tree 
for (P,D). Let x be a node of T whose depth is i. Then Pr(x) < (l/2^/ 4 J ) Pr(P). 

Proof. Assume i > 4, otherwise there is nothing to prove. Let r' be the fourth node on the path from 
the root, r, of T to x. Let P r i be the path in T from r to r' . If P r < has at least two P-nodes, then 
Pr(r') < (l/2)Pr(r) = (l/2)Pr(P). Otherwise, the second node in P r , is a T-node followed by 2 Q- 
nodes. By construction, for any Q-node y whose parent is a Q-node z, Pr(y) < (l/2)Pr(z). Therefore, 
Pr(r') < (1/2) Pr(parent(r')) < (1/2) Pr(r) = (1/2) Pr(P). If x = r', then i = 4 and the proof is complete. 
Otherwise, apply the same argument inductively on the path from r' to x, to obtain 

Pr(s) < Pr(r') • 1/2 L( ^ 4)/4J = Pr(r') • 1/2 L,;/4J - 1 < (l/2 Ll/4j ) Pr(r) = (l/2 Li/4J ) Pr(P) , 

and this completes the proof. □ 

Our next lemma says that a single line segment does not intersect very many high probability 
triangles in A. 

Lemma 2. Let P be a polygon, let D be a probability measure over R 2 , and let T = T(P, D) be the A-tree for 
(P, D), let s C P be a line segment, and let Si C V(T) be the set of all vertices x € V(T) that are distance 
at most i from the root ofT and such that int(x) intersect s. Then \Si\ < 6i 2 
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Figure 4: The A-tree T. Yellow leaves in this tree are the root of subtrees obtained recursively. Grey areas 
show the portions of a node not covered by its children. A black edge from a node x to a node y indicates 
that Pr(y) < (1/2) Pr(a;). A solid edge leading into a node x indicates that x is separated from the rest of 
P by a shortest path in P. 9 



Proof. There are 3 types of nodes in T whose interiors intersect s: Type 1 nodes contain one endpoint of s 
in their interior, Type 2 nodes contain both endpoints of s in their interior, and Type nodes contain no 
endpoints of s in their interior. Notice that each level of T contains at most 1 Type 2 node or 2 Type 1 
nodes, so the total number of Type 1 and Type 2 nodes at distance at most i from the root is at most 2i + 1. 
Thus, all that remains is to bound the number of Type nodes whose distance from the root is at most i. 

Let w be any P-node such that int(w) does not contain either endpoint of s. Since w is a P-node, 
there is a reflex chain C w on the boundary of w that is a shortest path between two points on the boundary 
of P, and every path in P from mt(w) to s intersects C w . Stated another way, the interior of a P-node does 
not intersect s unless it contains at least one endpoint of s. Therefore, all Type nodes arc either T-nodes 
or Q-nodes. 

For every Type node x, there is a path P x in T from x to a T-node that is adjacent to a Type 1 or 
Type 2 node. Furthermore, the path P x consists of x followed by or more Q-nodes, and terminates with a 
T-node. Looking more closely at the definition of Q-nodes, we see that two sibling Q-nodes x and y are not 
mutually visible, i.e., there is no line segment sCP that intersects both int(x) and int(y). 

All of this implies that each of the at most 2i + 1 Type 1 or Type 2 nodes is adjacent to at most 1 
Type T-node, and this T-node is the endpoint of at most 3 paths of Type Q-nodes. Each such path is 
of length at most i. Therefore, the total number of nodes in T that intersect s is at most 2i ■ 3i = 6i 2 . □ 



3.3 Minimum-Entropy Triangulation 

Next, we show that the triangulation A defined above is nearly-minimum entropy over all possible triangu- 
lations of P. Wc do this by developing a technique for lower-bounding the entropy of one triangulation in 
terms of the entropy of another triangulation. We then show how to apply this technique to lower bound 
the entropy of any triangulation A* in terms of the entropy of A. 

To obtain lower bounds on the entropy of a triangulation A*, consider the following easily proven 
observation: If each triangle in A* intersects at most c triangles of some triangulation A then H(A*) > 
ff(A)-logcH This observation allows us to use A to prove a lower bound on the entropy of a triangulation 
A*. Unfortunately, the condition that each triangle of A* intersect at most c triangles of A is too restrictive 
for our purposes. Instead, we require following stronger result: 

Lemma 3. Let D be a probability measure over R 2 . Let A and A* be triangulations, and let {Ai, . . . , A m } 
be a partition of the triangles in A. Suppose that, for all i £ {1, . . . , to} and for each triangle t* £ A*, t* 
intersects at most a triangles in Ai. Then 

m 

H(A) <ff(A*) + ff({UAi,...,UA m }) + ^Pr(UA i )logc i . 

i=l 

Intuitively, Lemma [3] can be thought of as follows: If we tell an observer which of the Ai a point 
p drawn according to D occurs in then the amount of information we are giving the observer about the 
experiment is at most i/({UAi, . . . , UA m }). However, after giving away this information, we are able to 
apply the simple observation in the previous paragraph, since each triangle in A* intersects at most Cj 
elements of each Aj. Thus, Lemma [3] is really just m applications of the simple observation. The following 
proof formalizes this: 

8 Proof: Consider the set X = {t* (It : t* £ A*, t £ A}. Each triangle of A* contributes at most c pieces to X, so we have 
H(A) < H(X) < H(A*) + logc. 
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Proof. 

fr(A*)+£r({UAi,...,UA m }) > ff(A*,{uA 1 ,...,UA„J) 

m 

= 1212 Pr(i*n A)io g (i/Pr(rnAO 



i=l t*GA* 



> EE E (Pr(i*nt)log(l/Pr(t*nt)-lo gCi ) 

i=l teAi t* GA* 
m m 

> EE Pr W lo ^y Pr W) - E Pr ( uA *) lo s c « 

i=l iGAi i=l 
m 

= ff(A)-EPr(UA i )logc 4 , 

i=i 

and this completes the proof. □ 

The remainder of our argument involves partitioning the triangles of A into subsets Ai, . . . , A m 
and then showing that H(Ai, . . . , A m ) and YITLi Pr(UAi) logCj are not too big. To help us, we will use the 
A-tree T. For a node x in T with children x\, . . . , Xk, let = a;\ (Ui=i x i) be the portion of x not covered 
by x's children. Note that t(x) is always either the empty set or is a triangle in A (see Figured]). In fact, 
for every triangle t £ A, there is exactly one x £ V(T) such that t(x) — t, and for every x £ V(T) such that 
t(x) is non-empty there is exactly one t £ A such that t(x) = t. This implies thai@ 

h(a) - e Pr W lo g(v Pr ( f )) = E Pr W x )) lo s(v Pr (*(^))) • 

For a node a; £ V(T), we define Pr(x) = Pr(t(x)) is the probability that a point drawn from D is contained 
in t{x). 

Next we apply Lemma [3] to obtain a lower bound on the entropy of any triangulation A* . 

Lemma 4. Let P be a simple polygon, let D be a probability measure over R 2 , and consider the triangulation 
A = A(P,D). Then, for any triangulation A* of P, 

H(A) < H{A*) + 0(H(A*) 2 ^ 3 + 1) . 



Proof. Let T = T(P, D) be the A-tree for (P, D). Partition the nodes of T into groups G\, G%, . . . where 

d = {x £ V(T) :l/2 l < Pr(x) < 1/2^} . 

In the following we will fix a value a, < a < 1, to be defined later. A group Gi is large if it contains at 
least 2 m elements, otherwise Gi is small. Let I + denote the index set of the large groups, i.e., I + = {i £ N : 
\d\ > 2 m }. Let I~ = N \ /+ be the index set of the small groups. 

Note that, for any group Gi, Lemma Q] ensures that all elements of Gi have depth at most Ai in T. 
Therefore, Lemma [D ensures that any triangle of A* intersects at most 3 x 6 x (4i) 2 = 288i 2 triangles of Gi. 
Therefore, applying Lemma [3] with a = 288i 2 , we obtain: 

oo 

H{A) < H(A*) + H({UG t . : i G N}) + E Pr ( uG! i) log(288i 2 ) . (3) 

i=l 

9 Here, and throughout the remainder, we slightly abuse notation by using the convention that ■ log(l/0) = 0. 
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Thus, all that remains is to bound the contribution of the last two terms on the right hand side of First, 



^Pr(UG,)log 



= E E Pr W lo s( 288z2 ) 

»=1 t£Gi 



^ ££Pr(i)(0(l)+loglog(l/Pr(t))) 

i=i ted 

= £Pr(t)(0(l)+loglog(l/Pr(f))) 



teA 

= 0(1 + log H (A)) , 

where the last equality follows from Jensen's Inequality. Finally, we show that the contribution of H 
H({U{G l } : i e N}) is at most 0(iJ(A) 2 / 3 ). 

H = H({U{Gi} : i € N}) 

OO 

= 5^Pr(UGi)log(l/Pr(UGi)) 
i=l 

= E Pr(UG,)log(l/Pr(UGO 

iei+ 

< E Pr(UG l )log(l/Pr(UG l ) 

iel+ 

< E Pr(UG01og(l/Pr(UG0 

iei+ 

= E Pr(UG i )log(l/Pr(UG i ) 
= E Pr(UG 4 )log(l/Pr(UG 4 ) 

i£l+ 

= E Pr(UG l )log(l/Pr(UG l ) 



< E Pr(UG 4 )log(l/Pr(UGO 



E PriUGOlogCl/PrCuGi)) 
E 2 Q V2 l - 1 log(2 l ) 

oo 

^ 2 ai /2 i ~ 1 log(2*') 

i=l 
oo 

Ei2 a V2 i - 1 

oo 

2-EV2 (1_Q)t 



2 ■ 



(1/2) 1 



(1 - (1/2)1-°)* 
+ 0(1/(1 -a) 2 ) , 



where the last equality is obtained using the Taylor series expansion for e x to obtain the inequality 1— 1/2 X > 
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x\n2 — (x 2 ln2)/2 for x close to 0. Continuing, we get 

H < Pr(UG,)log(l/Pr(UG,)) + 0(1/(1 -a) 2 ) 

< Yl Pr ( UG *) lo g( 2 V|G 4 |) + 0(1/(1 - a) 2 ) 

< ^ Pr(UGi) log(2V2 m ) + 0(1/(1 - a) 2 ) 
= ]T Pr(UG i )(l-a^ + 0(1/(1 -a) 2 ) 

= (1-a) 53 Pr(UG,;)« + 0(1/(1 -a) 2 ) 

= (1 - a) ^ Pr(UG,) log(2 l ) + 0(1/(1 - a) 2 ) 

= (1 - «) E E Pr W lQ g( 21 ) + - «) 2 ) 

= (! - «) E E Pr (*) mv Pr (*)) + °( 1 + v(i - «) 2 ) 

< (l-a)ff(A)+0(l + l/(l~a) 2 ) 

< 0{H{A) 2 ^ 3 + 1) 

Where the last inequality is obtained by setting a = 1 — l/i^A) 1 / 3 . Thus, we have shown that 

H(A) < H(A*) + 0(H(A) 2/3 + 1) , (4) 

which implies that H(A) = 0(H(A)* + 1). Applying this to the right hand side of (4} yields H(A) < 
H(A*) + 0(H(A*) 2 / 3 + 1), completing the proof. □ 

Lemma |4] shows that the triangulation A = A(P, D) defined previously is nearly minimum-entropy 
over all triangulations of P. The following theorem gives an algorithmic version of Lemma |4] 

Theorem 2. Let P be a simple polygon with n vertices, and let D be a probability measure over R 2 . Then 
there exists an O(nlogrt) time algorithm that computes a triangulation A' of P having 0{n) triangles and 
such that, for any triangulation A* of P , 

H(A') < H(A*) + 0{H{A*) 2 ' 3 + 1) . 

Proof. We show how the construction of the triangulation A described in Section 13.11 can be modified to run 
in O(nlogn) time. When constructing A the first step is to find the third vertex pk of the geodesic triangle 
t = ApiPjPk- This can be accomplished in 0(n) time by computing the shortest path trees from pi and pj 
to all other vertices of P and using these to find pk- For an example of a similar computation, see Bose et 
al. Section 2.2]. 

Next, t is split into three 2-convex pseudotriangles to,t±,t2, which is easily accomplished in O(n) 
time. The last step, before recursing, is to triangulate each of to,ti,t2- This step can be accomplished in 
O(n) time using a 2-sided exponential searching trick that was used by Mehlhorn |19j in the construction of 
biased binary search trees (see also, Collette et al. Theorem 1]). 
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Finally, the algorithm recurses on each of the pieces Pi, ... , P m . In this way, we obtain a divide- and- 
conquer algorithm for constructing A. Unfortunately, this algorithm may have running time Q,{n 2 ) since 
there is no bound significantly smaller than n on the size of an individual subproblem Pj. To overcome this, 
before recursing on a subproblem Pi we check if it contains more than n/2 vertices. If so, then rather than 
recursing normally on Pj we choose a geodesic triangle t* , one of whose sides is the reflex chain C\ and such 
that removing t* from Pj leaves a set of subpolygons P^i, . . . , Pj )mi each with at most n/2 vertices. This 
modification then yields an algorithm whose recursion tree has depth 0(log n) and at which the work done 
at each level is 0(n), so the total running time of this algorithm is 0(n\ogn). 

Note that this algorithm yields a triangulation A' that is different from A. In particular, there may 
exist one Pij with Pr(Pij) > Pr(Pj)/2. Despite this, all the proofs of Lemmas HHH continue to hold almost 
without modification. The only difference occurs in Lemma [H which now only guarantees a bound of 2 L Z / 8 J 
on the number of black edges, but this has almost no effect on subsequent computations. 

Finally, to see that A' contains 0(n) triangles, we count the different types of edges used in the 
triangulation A'. Some of these edges are edges of P, of which there are at most n. Some of these edges 
are edges of geodesic triangles, which always connect two vertices of P and do not cross each other, so there 
are at most n — 2 of these. The remaining edges are used to triangulate the interiors of pseudotriangles. 
A pseudotriangle that has k vertices is triangulated using 3 + 2(k — 3) edges. Since the total number of 
vertices in all pseudotriangles is at most 2n, this means that there are at most 6n edges used to triangulate 
pseudotriangles. Therefore, the total number of edges used by triangles in A', and hence the number of 
triangles in A', is O(n). □ 



4 Point Location in Simple Planar Subdivisions 

Next we consider the problem of point location in simple subdivisions. The following theorem of Arya et 
al. [6] shows that a low entropy triangulation can be used to make a good point location structure. 

Theorem 3 (Arya et al. 2007). Let D be a probability measure over R 2 and let A be a triangulation of 
R 2 having a total of n triangles. Then there exists a data structure of size 0{n) that can be constructed in 
O(nlogn) time, and for which the expected number of point /line comparisons required to locate the face of 
G containing a query point p, drawn according to D, is if (A) + 0(H(A) 1 ^ 2 + 1). 

The following lemma shows that the entropy of a minimum-entropy triangulation gives a lower bound 
on the cost of any point location structure. 

Lemma 5. Let T* be any linear decision tree for a classification problem V over R 2 . Then there exists a 
linear decision tree T' for V , such that, for each leaf I of T' , clo(r(£)) is a triangle and T' satisifies 

MT') <li D {T*) + 0{\o gl i D {T*)) 

for any probability measure D over R 2 . 

Proof. Each leaf £ of T* has a region r(£) that is a convex polygon. If r(£) has k sides then the depth of £ 
in T is at least k. To obtain the tree T' replace each such leaf £ of T* by a balanced binary tree of depth 
O(logfc) by repeatedly splitting the leaf into two children £\ and £ 2 whose regions have [~(fc + 2)/2] and 
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[(k + 2)/2j vertices. For a leaf I £ L(T*), let s{£) denote the set of leaves in T" in the subtree of I. Then 





Pr(r(f)) • depth(€) 








Prfrf/ll • denthW 








2 y Pr(r(£ )) • (depth(£ ) — G> (log (depth (t)))) 


££L(T«) 




= Mr') 


J] Pr(r(f )) ■ 0(log(depth(£))) 




££L(T*)e>€s(£) 




53 Pr(r(*)) • O(log(depth(0)) 




eeL(T') 


> Mt') 


-0(log(/x^(T*)) , 



where the last inequality is an application of Jensen's Inequality. □ 

Lemma[S]says that for any linear decision tree for point location, there is an underlying triangulation. 
The entropy of this triangulation gives a lower bound on the cost of the decision tree. Thus, the entropy of 
a minimum entropy triangulation gives a lower bound on the expected cost of any linear decision tree for 
point location. 

Keeping the above in mind, our point location structure is simple. Let G be a connected planar 
subdivision whose faces are F = {F\, . . . , F m } and let D be a probability measure over R 2 . We assume, 
without loss of generality that the outer face of G is the complement of a triangle, since otherwise we can 
add at most 3 vertices and 4 edges to G to make this true. Adding these edges will not increase the entropy 
the minimum weight triangulation of G by more than a constant. With this assumption, testing if the query 
point is in the outer face of G can be done using 3 linear comparisons after which we may safely assume that 
the query point is contained in an internal face of G. 

We triangulate each internal face Fi of G (a near-simple polygon) using Theorem [2] to obtain a 
triangulation Aj. The union of all Aj is a triangulation A of R 2 , to which we apply Theorem [3] to obtain a 
point location structure R = R(G, D) for point location in A and hence also in G. The following theorem 
shows that R is nearly optimal: 

Theorem 4. Given a connected planar subdivision G with n vertices and a probability measure D over R 2 , 
a data structure R = R(G,D) of size O(n) can be constructed in O(nlogn) time that answers point location 
queries in G. The expected number of point/line comparisons performed by R, for a point p drawn according 
to D is 

MR)<Vd(T*) + 0(hd(T*) 2 / 3 + 1) , 
where T* is any linear classification tree that answers point location queries in G. 



Proof. The space and preprocessing requirements follow from Theorem [5] and Theorem [3] To prove the 
bound on the expected query time, apply Lemma [5] to the tree T* and consider the resulting tree T", each 
of whose leaves have regions that are triangles and such that 

MT') < MT*) + 0(logMT*)) • (5) 
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Observe that each leaf of X" corresponds to a triangle in R 2 that is completely contained in one of the faces 
of G. Let A' denote this set of triangles and let A[ denote the subset of A' contained in F{ . Consider the 
entropy H(A') of the distribution induced by the leaves of X": 

m 

/f(A') = ^^Pr(i)log(l/Pr(i)) 
i=l teA'. 

m 

= E Pr ^) E Pr(t|i ? i)log(l/Pr(t)) 
i=i teAr 

m 

= E Pr ^) E Pr W^) ( lo g(v Pr (*i^)) - lo s( Pr (^))) (6) 

i=l teA'. 

m m 

= E Pr ^) E Pr M F <) M 1 / Pr (*i^)) + E Pr ^) ^(v Pr (^)) 

m 

= ^Pr(F i )/f(A9+ J ff(F) . 

i=l 

Similarly, the entropy of A is given by 

m 

if (A) = ££Pr(t)log(l/Pr(t)) 

z=i teAi 

m 

= 2 Pr(Fi) £ Pr(t|F0 log(l/ Pr(t|J})) + 

i=l teAi 
m 

= ^Pr(F)i?(A,) + W) • 

By Theorem [2J the triangles in Ai form a nearly- minimum entropy triangulation of Fi. More specifically, 

If (A 8 ) < H(A' i ) + 0(i?(A-) 2 / 3 + 1) . (7) 
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Putting this all together, we have 
ri i 

H(A) = ]T Pr(Fi)H (A,) + H(F) 

2=1 

m 

< Pr(F,)(if(^) + 0(ff(A^) 2/3 + 1)) + H{F) (by ©) 

m 

= H(A') + Pr(F t )0(H(Atf/ 3 + 1) (by ©) 

/ m \ 2 / 3 

= H(A') + I ^ Pr(F l )0(7J(A' J ) I + 0(1) (by Jensen's Inequality) 



/ \ 2/3 

= H(A , )+lo(l)-jrPr(F i ) Pr^lFOloga/Pr^'lF,)) +0(1) 
\ i=l t'eAj J 

( \ 2/3 

= H(A') + [ O(l) • £ £ Pr(f ) log(Pr(F,)/ Pr(i')) +0(1) 
\ i=i cea; / 

/ \ 2/3 

< h(a') + Pr (*') ^(V Pr (*')) + 

\ »=i t'eA< / 
= (A') + 0(ff(A') 2/3 + 1) 

< //o(T') + 0(^ D (T') 2/3 + 1) (by Theorem^ 

<Md(T*) + 0( M d(T*) 2 / 3 + 1) (by©) 

Finally, since we preprocess A using Theorem [31 the expected number of comparisons required to answer a 
query is 

Hd(R) = H(A) + 0(H(A)^ 2 + 1) 

< fi D (T*) + OUi D {T*) 2 ^ + l) 

and this completes the proof, and the paper. □ 
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